智能论文笔记

Breast Cancer Classification Based on Histopathological Images Using a Deep Learning Capsule Network

Hayder A. Khikani , Naira Elazab , Ahmed Elgarayhi , Mohammed Elmogy , Mohammed Sallah

分类：计算机视觉

2022-08-01

乳腺癌是女性可能发生的最严重的癌症之一。通过分析组织学图像（HIS）来自动诊断乳腺癌对患者及其预后很重要。他的分类为临床医生提供了对疾病的准确了解，并使他们可以更有效地治疗患者。深度学习（DL）方法已成功地用于各种领域，尤其是医学成像，因为它们有能力自动提取功能。这项研究旨在使用他的乳腺癌对不同类型的乳腺癌进行分类。在这项研究中，我们提出了一个增强的胶囊网络，该网络使用RES2NET块和四个额外的卷积层提取多尺度特征。此外，由于使用了小的卷积内核和RES2NET块，因此所提出的方法具有较少的参数。结果，新方法的表现优于旧方法，因为它会自动学习最佳功能。测试结果表明该模型的表现优于先前的DL方法。

translated by 谷歌翻译

DiffTune$^+$: Hyperparameter-Free Auto-Tuning using Auto-Differentiation

Sheng Cheng , Lin Song , Minkyung Kim , Shenlong Wang , Naira Hovakimyan

分类：机器人

2022-12-06

Controller tuning is a vital step to ensure the controller delivers its designed performance. DiffTune has been proposed as an automatic tuning method that unrolls the dynamical system and controller into a computational graph and uses auto-differentiation to obtain the gradient for the controller's parameter update. However, DiffTune uses the vanilla gradient descent to iteratively update the parameter, in which the performance largely depends on the choice of the learning rate (as a hyperparameter). In this paper, we propose to use hyperparameter-free methods to update the controller parameters. We find the optimal parameter update by maximizing the loss reduction, where a predicted loss based on the approximated state and control is used for the maximization. Two methods are proposed to optimally update the parameters and are compared with related variants in simulations on a Dubin's car and a quadrotor. Simulation experiments show that the proposed first-order method outperforms the hyperparameter-based methods and is more robust than the second-order hyperparameter-free methods.

translated by 谷歌翻译

Safe Model-Free Reinforcement Learning using Disturbance-Observer-Based Control Barrier Functions

Yikun Cheng , Pan Zhao , Naira Hovakimyan

分类：机器人 | 机器学习

2022-11-30

Safe reinforcement learning (RL) with assured satisfaction of hard state constraints during training has recently received a lot of attention. Safety filters, e.g., based on control barrier functions (CBFs), provide a promising way for safe RL via modifying the unsafe actions of an RL agent on the fly. Existing safety filter-based approaches typically involve learning of uncertain dynamics and quantifying the learned model error, which leads to conservative filters before a large amount of data is collected to learn a good model, thereby preventing efficient exploration. This paper presents a method for safe and efficient model-free RL using disturbance observers (DOBs) and control barrier functions (CBFs). Unlike most existing safe RL methods that deal with hard state constraints, our method does not involve model learning, and leverages DOBs to accurately estimate the pointwise value of the uncertainty, which is then incorporated into a robust CBF condition to generate safe actions. The DOB-based CBF can be used as a safety filter with any model-free RL algorithms by minimally modifying the actions of an RL agent whenever necessary to ensure safety throughout the learning process. Simulation results on a unicycle and a 2D quadrotor demonstrate that the proposed method outperforms a state-of-the-art safe RL algorithm using CBFs and Gaussian processes-based model learning, in terms of safety violation rate, and sample and computational efficiency.

translated by 谷歌翻译

Simultaneous Spatial and Temporal Assignment for Fast UAV Trajectory Optimization using Bilevel Optimization

Qianzhong Chen , Sheng Cheng , Naira Hovakimyan

分类：机器人

2022-11-29

In this paper, we propose a framework for fast trajectory planning for unmanned aerial vehicles (UAVs). Our framework is reformulated from an existing bilevel optimization, in which the lower-level problem solves for the optimal trajectory with a fixed time allocation, whereas the upper-level problem updates the time allocation using analytical gradients. The lower-level problem incorporates the safety-set constraints (in the form of inequality constraints) and is cast as a convex quadratic program (QP). Our formulation modifies the lower-level QP by excluding the inequality constraints for the safety sets, which significantly reduces the computation time. The safety-set constraints are moved to the upper-level problem, where the feasible waypoints are updated together with the time allocation using analytical gradients enabled by the OptNet. We validate our approach in simulations, where our method's computation time scales linearly with respect to the number of safety sets, in contrast to the state-of-the-art that scales exponentially.

translated by 谷歌翻译

Geometric Tracking Control of Omnidirectional Multirotors in the Presence of Rotor Dynamics

Hyungyu Lee , Sheng Cheng , Zhuohuan Wu , Naira Hovakimyan

分类：机器人

2022-09-20

全向多旋转器具有脱钩的转换和旋转运动的有利的可操作性，可以极大地取代传统的多电气运动能力。这样的可操作性需要全向多旋转器，才能经常改变推力振幅甚至方向，这是转子从转子自身动态引起的沉降时间的容易产生的。此外，在存在转子动力学的情况下，全向多动物在跟踪控制的稳定性尚未得到解决。为了解决此问题，我们提出了一个几何跟踪控制器，该控制器考虑了转子动力学。我们表明，所提出的控制器几乎呈指数稳定的误差动力学的零平衡。在模拟中验证了控制器的跟踪性能和稳定性。此外，已经执行了具有全向多动物的单轴力实验，以确认所提出的控制器在减轻现实世界中转子的沉降时间方面的性能。

translated by 谷歌翻译

DiffTune: Auto-Tuning through Auto-Differentiation

Sheng Cheng , Minkyung Kim , Lin Song , Zhuohuan Wu , Shenlong Wang , Naira Hovakimyan

分类：机器人

2022-09-20

机器人控制器的性能取决于其参数的选择，这需要仔细调整。在本文中，我们提出了DiFftune，这是一个新型，基于梯度的自动调整框架。我们的方法将动态系统和控制器作为计算图展开，并通过基于梯度的优化更新控制器参数。与常用的后传播方案不同，Difftune中的梯度是通过灵敏度传播获得的，这是一种与系统演化平行的前向模式自动分化技术。我们验证了杜宾汽车上提出的自动调节方法和在挑战性模拟环境中的四型旋转。仿真实验表明，该方法对于系统动力学和环境中的不确定性是可靠的，并且可以很好地推广到调整中看不见的轨迹。

translated by 谷歌翻译

Optimizing Crop Management with Reinforcement Learning and Imitation Learning

Ran Tao , Pan Zhao , Jing Wu , Nicolas F. Martin , Matthew T. Harrison , Carla Ferreira , Zahra Kalantari , Naira Hovakimyan

分类：人工智能 | 机器学习

2022-09-20

农作物管理，包括氮（N）受精和灌溉管理，对农作物产量，经济利润和环境产生了重大影响。尽管存在管理指南，但要在特定的种植环境和农作物中找到最佳的管理实践是挑战。先前的工作使用加强学习（RL）和作物模拟器来解决该问题，但是训练有素的政策要么具有有限的性能，要么在现实世界中不可部署。在本文中，我们提出了一种智能作物管理系统，该系统通过RL，模仿学习（IL）同时优化N受精和灌溉，并使用农业技术决策系统（DSSAT）进行了作物模拟。我们首先使用Deep RL，尤其是Deep Q-Network来培训需要从模拟器中的所有状态信息作为观测值（表示为完整观察）的管理政策。然后，我们援引IL来培训管理政策，这些政策只需要有限的国家信息，这些信息可以通过模仿以前的RL训练有素的政策在全面观察中轻松获得的国家（表示为部分观察）。我们在佛罗里达州使用玉米的案例研究进行实验，并将受过训练的政策与玉米管理指南进行比较。我们在全面观察和部分观察中训练有素的政策取得了更好的结果，从而获得更高的利润或类似的利润，而环境影响较小。此外，部分观察管理政策在使用易于使用的信息时直接在现实世界中部署。

translated by 谷歌翻译

Synergistic Redundancy: Towards Verifiable Safety for Autonomous Vehicles

Ayoosh Bansal , Simon Yu , Hunmin Kim , Bo Li , Naira Hovakimyan , Marco Caccamo , Lui Sha

分类：机器人 | 机器学习

2022-09-04

随着自动驾驶汽车（AV）开发的发展，对环境中乘客和代理商的安全性的担忧已经上升。涉及自主控制车辆的每个现实世界交通碰撞都使这种担忧加剧了。开源自主驾驶实现显示了具有复杂相互依赖任务的软件体系结构，这很大程度上依赖于机器学习和深层神经网络（DNN），这些任务容易受到非确定性故障和角落案例的影响。这些复杂的子系统共同履行AV的任务，同时还保持安全性。尽管在提高对这些系统的经验可靠性和信心方面正在做出重大改进，但DNN验证的固有局限性在提供AV中提供确定性安全保证方面却引起了无法克服的挑战。我们提出了协同冗余（SR），这是一种用于复杂网络物理系统的安全架构，例如AV。 SR通过将系统的任务和安全任务解耦来提供可验证的安全保证。在独立履行其主要角色的同时，部分功能多余的任务和安全任务能够相互帮助，从而协同改善合并的系统。协同安全层仅使用可验证且可分析的软件来完成其任务。与任务层的密切协调可以更轻松，更早地检测系统中的紧急故障。 SR简化了任务层的优化目标并改进了其设计。 SR提供了高性能的安全部署，尽管本质上无法验证的机器学习软件。在这项工作中，我们首先介绍SR体系结构的设计和功能，然后评估解决方案的功效，重点关注AV中障碍物存在故障的关键问题。

translated by 谷歌翻译

Verifiable Obstacle Detection

Ayoosh Bansal , Hunmin Kim , Simon Yu , Bo Li , Naira Hovakimyan , Marco Caccamo , Lui Sha

分类：机器人 | 计算机视觉

2022-08-30

对障碍的看法仍然是自动驾驶汽车的关键安全问题。现实世界中的碰撞表明，导致致命碰撞的自治缺陷源于障碍物的存在。开源自主驾驶实现显示了具有复杂相互依存的深神经网络的感知管道。这些网络无法完全验证，使其不适合安全至关重要的任务。在这项工作中，我们介绍了现有的基于LIDAR的经典障碍物检测算法的安全验证。我们对该障碍检测算法的功能建立了严格的界限。考虑到安全标准，这种界限允许确定可以可靠地满足标准的激光雷达传感器属性。对于基于神经网络的感知系统，此类分析尚未实现。我们对障碍检测系统进行了严格的分析，并基于现实世界传感器数据提供了经验结果。

translated by 谷歌翻译

HTML版本

Guaranteed Contraction Control in the Presence of Imperfectly Learned Dynamics

Pan Zhao , Ziyao Guo , Yikun Cheng , Aditya Gahlawat , Naira Hovakimyan

分类：机器学习

2021-12-15

本文提出了一种基于匹配不确定性的非线性系统的收缩指标和干扰估计的轨迹中心学习控制方法。该方法允许使用广泛的模型学习工具，包括深神经网络，以学习不确定的动态，同时仍然在整个学习阶段提供瞬态跟踪性能的保证，包括没有学习的特殊情况。在所提出的方法中，提出了一种扰动估计法，以估计不确定性的点值，具有预计估计误差限制（EEB）。学习的动态，估计的紊乱和EEB在强大的黎曼能量条件下并入，以计算控制法，即使学习模型较差，也能保证在整个学习阶段的所需轨迹对所需轨迹的指数趋同。另一方面，具有改进的精度，学习的模型可以在高级计划器中结合，以规划更好的性能，例如降低能耗和更短的旅行时间。建议的框架在平面Quadrotor导航示例上验证。

translated by 谷歌翻译